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Abstract.  A  conjecture  on  the  convergence  of  diffusion  models  in 
population  genetics  to  a  simple  Markov  chain  model  is  proved.  The  notion 
of  bi-generalized  diffusion  processes  and  their  limit  theorems  are  used 
systematically  to  prove  the  conjecture.  Three  limits;  strong  selection  - 
weak  mutation  limit,  moderate  selection  -  weak  mutation  limit,  weak 
selection  -  weak  mutation  limit  are  considered  for  typical  diffusion 
models  in  population  genetics. 

Key  words:  Diffusion  model  -  Strong  selection-weak  mutation  limit  - 
Markov  chain  model  -  Bi-generalized  diffusion  process 


1 .  Introduction 


In  population  genetics  theory,  stochastic  models  are  described  by 

discrete  time  stochastic  processes  and  they  are  approximated  by 

appropriate  continuous  time  stochastic  processes  (see  Iizuka  and  Matsuda 

1982;  Iizuka  1987  and  the  cited  therein).  In  many  cases,  these 

continuous  time  stochastic  processes  are  diffusion  processes  and  are 

referred  to  as  diffusion  models  in  population  genetics  (Crow  and  Kimura 

1970;  Ewens  1979).  Let  x^(t),  i=l,2,...,K;  tiO  be  the  gene  frequency 

of  the  i-th  allele  at  time  t  in  a  population.  In  the  diffusion  model, 

x( t )=(Xj ( t ) ,Xg ( t ) . x^_^(t))  is  a  K-l  dimensional  diffusion  process 

K—  1 

with  x^tJiO,  i=l,2,...,K  and  =  ji  ( t  )i  1 .  The  boundary  condition, 

when  it  is  necessary,  is  usually  the  reflection  boundary  condition.  The 
infinitesimal  generator  of  this  diffusion  process  is 


(1.1)  L  =  \ 

where  x=(x^,x 
example,  the  d 


K-l 


2 , . . . , i )  with 

iffusion  terms,  a.. 

^  %j 


+  2  b • (x,o , 0 ) a®  , 

i=l  1  dXi 

x^sO,  i  =  l,2 . K-l 

(x,a)  and  the  drift 


and  .  For 

terms,  b^(x,a,0)  are 


2 


specified  in  Gillespie  (1989)  (see  Eq.  (1)).  The  parameter  a  denotes  the 
effect  of  selection  and  0  denotes  the  effect  of  mutation.  From  now  on, 
the  diffusion  processes  satisfying  the  above  conditions  are  called  the 
population  genetical  diffusion  processes.  For  these  diffusion  processes 
in  the  case  of  Ki3  in  general,  it  is  difficult  to  obtain  the 

biologically  interesting  quantities  such  as  the  average  homozygosity, 

K  2 

E[2i=ixi(t)],  the  stationary  distributions  if  exist  and  so  on.  The 
exceptions  are  the  case  of  no  selection  and  the  case  of  pure  stochastic 
selection  without  mutation  and  random  sampling  drift.  The  reason  why  it 
is  difficult  to  obtain  the  stationary  distributions  is  that  population 
genetical  diffusion  processes  are  not  time  reversible  in  many  cases  (in 
physics  term,  there  is  no  "flux  zero  solution"  for  the  Kolmogorov  forward 
equation).  The  numerical  analysis  for  the  diffusion  equation  or  the 
computer  simulation  for  the  corresponding  population  genetical  diffusion 
process  may  be  applicable,  but  these  methods  are  again  difficult  to 
perform  in  the  case  where  K  is  large. 

Gillespie  (1983a  and  1983b)  proposed  the  following  conjecture  in 
which  K-l  dimensional  population  genetical  diffusion  processes  with 
selection,  mutation  and  the  random  sampling  drift  are  approximated  by 
simple  Markov  jump  processes.  Let  XE  ( t  )  =  (XE  ( t ), XE  ( t ),..., xjjt_^  ( t ) )  be 
a  K-l  dimensional  population  genetical  diffusion  process  with  the 
infinitesimal  generator 


(1.2) 

Le 

l  K_1  s2  K-l 

“  2  i  ?=1aij  (x,0(8)8xi8Xj  +  i|1bi(x,ote,0e)5x“ 

where 

8>0 

and  a„,0„>O. 

8  8 

Conjecture 

(Gillespie) 

Assume  that  a_-*+®  and  0_-*O  as  e-*0  where  a„0„  is  fixed.  Then 
8  8  8  8 

{ Xs  ( t )  }  converges  weakly  to  a  Markov  chain  as  e-*0. 

In  this  conjecture,  the  limiting  Markov  chains  are  characterized  as 
follows.  Assume  that  there  exist  two  limits  A^  j  (  x )  =  1  ime_>gai  j  ( x  ,ag  ) /a£ 
and  B . (x) =1 im_  ~b . ( x ,a_ , 0_ )/a_ .  (1)  The  case  of  A..(x)=0  for 

1  g-MJ  IEEE  1 J 
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i . j=l , 2 , . . . ,K-1 .  The  state  space  of  the  Markov  chain  is  {x;B^(x)=0, 
i  =  l ,  2  ,  .  .  .  ,  K-l } ,  that  is,  a  set.  of  the  equilibrium  points  for  a 
deterministic  motion  (x)dg^t)  .  (2)  The  case  of  A^j(x)>0  for 

some  i  and  j.  Assume  that  there  exist  the  stationary  distributions  for 
a  diffusion  process  with  the  infinitesimal  generator 


9“  ^K-l, 

~^j 

chain  is  the  set  of  the  statio.  ary  distributions  for  this  diffusion 


t  =  2  ^  j  =  lAij(x)  §x  5x  7+2^[=}Bi  (x)g^— .  The  state  space  of  the  Marko 

i  J  i 


process,  that  is,  the  limiting  Markov  chain  is  a  measure  valued  Markov 
chain. 

By  the  following  proposition,  however,  the  conjecture  of  Gillespie 
never  holds  in  the  sense  of  weak  convergence  in  the  Polish  space  D  with 
the  Skorohod  topology  (Billingsley  1968). 

Proposition 

Let  {y  (t) } ,n=l ,2 , . . . .  be  a  sequence  of  diffusion  processes.  The 
{ yn ( t }  does  not  converge  weakly  in  D  to  a  jump  process. 

Proof .  Assume  that  (yn(t)}  converges  weakly  to  y(t)  in  D.  Let  Pn 

and  P  be  the  induced  probability  measures  on  D  by  yn(t)  and  y(t), 
respectively.  The  space  of  the  continuous  functions  C  is  a  closed 
subset  of  D  (Billingsley  1968).  By  weak  convergence  of  { yn ( t ) }  to 
y(t)  in  D,  we  have  P(C)ilimsup  +00P  (C)  =  l.  This  means  that  y(t)  is 
continuous  with  probability  one.  □ 


By  this  proposition,  the  conjecture  of  Gillespie  does  not  hold  in  the 
sense  of  weak  convergence  in  D.  There  is,  however,  a  possibility  that 
the  conjecture  holds  in  a  weaker  sense  of  convergence  such  as  the 
convergence  of  the  finite  dimensional  distributions.  Indeed,  Kipnis  and 
Newman  (1985)  proved  the  convergence  of  one-dimensional  diffusion 
processes  related  to  the  metastable  behavior  in  statistical  physics  to  a 
jump  process  in  the  sense  of  the  finite  dimensional  distributions.  In 
their  proof,  however,  the  constancy  of  the  diffusion  terra  is  essential  and 
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hence  not  applicable  to  the  conjecture  of  Gillespie. 

It  seems  us  that  the  proof  of  the  conjecture  of  Gillespie  for  Ki3  is 

difficult.  The  results  of  Fleidlin  and  Wentzel  (1984)  are  closely 

related  to  this  problem  but  their  results  are  rather  abstract  and  do  not 

present  limit  theorems  corresponding  to  the  conjecture  of  Gillespie.  Even 

the  case  of  K=2,  there  are  several  questions  about  the  conjecture.  First 

of  all,  does  the  conjecture  hold  in  the  sense  of  the  convergence  of  the 

finite  dimensional  distributions?  What  is  the  appropriate  time  scaling  to 

obtain  a  limiting  jump  process  (though  Gillespie  does  not  mention  about 

this)?  What  is  the  relation  between  a_  and  8  to  obtain  a  limiting 

&  & 

jump  process?  What  is  the  characterization  of  the  limit  process? 

In  this  paper,  we  will  consider  the  case  of  K=2.  Even  the  case  of 
K=2,  this  is  an  interesting  problems  for  the  limit  theorems  in  the  theory 
of  stochastic  processes.  For  example,  this  is  a  new  type  of  limit 
theorem,  that  is,  from  the  processes  with  continuous  paths  to  a  jump 
process.  Further,  the  answers  for  the  above  questions  will  be  hints  for 
the  proof  of  the  conjecture  in  the  case  of  Ki3. 

The  traditional  methods  to  prove  the  limit  theorems  are  the  semigroup 
approximation  theorems  (for  example,  see  Kurtz  1975)  and  the  martingale 
methods  (Stroock  and  Varadhara  1979).  These  methods,  however,  are  not 
applicable  to  the  present  problem.  Indeed,  the  coefficients  of  the 
infinitesimal  generator  have  no  limit  in  some  cases  as  we  will  see  later. 

Recently,  one  of  the  authors  proposed  a  class  of  one-dimensional 
stochastic  processes  called  the  bi-general ized  diffusion  processes  and 
proved  some  limit  theorems  for  this  class  of  processes  (Ogura  1989).  We 
will  use  this  method  systematically  in  this  paper.  In  §2,  we  will  review 
the  results  on  the  bi-general ized  diffusion  processes.  We  will  prove  two 
limit  theorems  here,  since  the  general  results  of  Ogura  (1989)  do  not 
cover  some  cases  in  this  paper.  The  conjecture  of  Gillespie  for  the  case 
of  K=2  will  be  described  explicitly  in  §3.  Here,  we  will  consider  not 
only  the  cases  of  a  -*+»  and  8  ->0  but  also  the  cases  of  a  -*a(0s:a<+«') 
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and  6g-*0.  In  §4  the  results  on  the  cases  introduced  in  §3  will  be 
obtained  using  the  limit  theorems  presented  in  §2.  In  §5  we  will 
consider  the  other  time  scaling  to  obtain  a  non-trivial  limit  process 
since  the  time  scaling  in  §4  leads  to  a  trivial  limit  in  some  cases.  We 
will  discuss  the  results  obtained  so  far  in  §6. 

2.  Bi-general ized  diffusion  processes  and  limit  theorems 


2.1.  Feller’s  canonical  form  for  one-dimensional  diffusion  processes 
Let  x(t),tiO  be  a  one-dimensional  diffusion  process  (ODDP  for 
brief)  on  the  state  space  ( fl  ,  r)  ,  -oo£fl.<r£+°o ,  with  the  infinitesimal 
generator 


(2.1)  L  =  ^  +  b<x)^ 

and  given  boundary  conditions  if  they  are  necessary.  The  representation 


(2.1)  is  referred  to  as  Kolmogorov’s  canonical  form  and  this  is  equivalent 


to 

(2-2)  L  =  asfrj-  Hifrj-’ 


(2.3) 

s(x) 

=  f"  e-B<^dy, 

J*o 

(2.4) 

m(x) 

■  II 
x0 

where 

B(x) 

=  2j^y)dy,  x0  is  a  constant  (£<x0<r)  and  Aixir.  The 

representation  (2.2)  is  referred  to  as  Feller’s  canonical  form.  The 


continuous  and  strictly  increasing  functions  s(x)  and  m(x)  are 


referred  to  as  the  scale  function  and  the  speed  measure,  respectively. 
Mote  that  we  can  replace  a  pair  s(x)  and  m(x)  by  s^(x)=cs(x)  and 
m j ( x ) =m ( x)/c ,  where  c  is  a  positive  constant.  Conversely,  extending 
the  two  functions  s(x)  and  ra(x)  on  (£,  r)  to  R  =  (-oo,+<») 
appropriately  according  to  the  nature  of  the  boundary  points  £  and  r, 
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s(x)  and  ra(x)  which  are  continuous  and  strictly  increasing  on  (£, r) 
determine  a  unique  ODDP  on  (A,r). 

2.2.  Generalized  diffusion  processes 

Feller  (1959)  proposed  a  class  of  one-dimensional  stochastic 

processes  including  ODDPs  by  means  of  weakening  the  conditions  on  m(x). 

Let  s(x)  be  a  continuous  and  strictly  increasing  function  on  (fl.,r)  and 

m(x)  be  a  right  continuous  nondecreasing  function  on  (11,  r).  Then  the 

pair  s(x)  and  m(x)  (after  their  extension  to  R)  determine  a  unique 

strong  Markov  process.  This  class  of  stochastic  processes  are  referred 

to  as  the  generalized  diffusion  processes  (GDPs  for  brief).  For  example, 

a  birth  and  death  process  is  a  GDP.  Let  x  (t)  be  a  GDP  with  the  scale 

n 

function  sfi(x)  and  the  speed  measure  m  (x) ,n=l , 2 , . .  .  .  We  may  expect 

limit  theorems  where  the  convergence  of  (sn(x)}  and  (mn(x)}  implies 

the  convergence  of  a  sequence  of  GDPs  (xn(t)}  in  some  sense,  for  example 

the  sense  of  the  convergence  of  the  finite  dimensional  distributions.  The 

limit  function  s(x)  of  {s  (x)),  however,  is  not  continuous  nor  strictly 

n 

increasing  in  general.  In  other  words,  the  class  of  GDPs  is  not  suitable 
to  consider  the  limit  theorems.  For  this  reason,  it  is  necessary  to 
extend  the  class  of  GDPs  by  means  of  defining  a  new  class  of 
one-dimensional  stochastic  processes  by  a  pair  s(x)  and  m(x)  that  are 
not  continuous  nor  strictly  increasing  in  general. 

2.3.  Bi-general ized  diffusion  processes 

Recently,  Ogura  (1989)  extended  the  notion  of  a  generalized  diffusion 
process  to  that  of  a  bi-general i zed  diffusion  prooess  (BGDP  for  brief), 
which  is  a  Markov  process  corresponds  to  a  nondecreasing  function  s(x) 
and  a  right  continuous  nondecreasing  function  m(x) .  In  the  remaining  of 
this  section,  we  summarize  his  definition  and  the  analytical  construction 
of  BGDPs,  and  then  give  two  limit  theorems  for  the  BGDPs ,  which  are  slight, 
extension  of  those  ibid. 
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2.3.1.  Definition  and  analytical  construction  of  the  bi-general i zed 


diffusion  processes 

In  this  subsection,  we  will  review  the  results  in  Ogura  (1989)  on  the 
construction  of  BGDPs  and  their  stopped  motions.  Let  2*;+oo, 

Q=(2^,22),  §=[2i>22]  arK*  consider  two  nondecreasing  functions  s,  m  on  Q. 
Since  we  will  only  be  concerned  with  the  Stieltjes  measure  dm  induced 
by  m,  we  assume  that  m  is  right  continuous.  Under  some  mild 
assumptions,  we  can  construct  a  unique  Markov  semigroup  on  Q  associated 
to  s  and  m  in  the  following  way  (for  the  rigid  statement  and  its 
justification,  see  ibid.). 

Let  dm  the  image  measure  of  dm  by  the  map  s: 

(2.5)  f  qU)dmU)  =  f  q°s(x)dm(x),  q  e  C(s(Q)), 

Js(Q)  JQ 

where  C(S)  is  the  space  of  all  bounded  continuous  functions  on  a  set  S. 

We  want  to  define  a  GDP  X  associated  to  the  speed  measure  din  and  the 

natural  scale  s(fj)=£.  In  order  to  do  so  we  must  settle  the  state  space 

and  the  boundary  conditions  when  they  are  necessary.  Exploiting 

Watanabe’s  convention,  however,  we  can  carry  over  the  boundary  conditions 

to  the  state  space  by  extending  it  and  by  setting  the  trap  conditions  at 

the  end  points  (see  Watanabe  1975  and  Minami  et  at.  1985  for  detail). 

Thus  we  set  (3=(2lt22),  §'=[2i>22]  and  /§\s  (Q)dra(g  )=0 ,  where 

[-oo.inf  s(Q)]  and  22e[sup  s(Q),+<»j.  Notice  that  2^,  i  =  1 , 2  are 

uniquely  determined  from  s  and  ra  in  Ogura  (1989)  by  the  convention 

there.  More  precisely,  let  E=[-oo,+a>]  and  extend  s  and  m  to  the 

functions  E-*E  so  that  i)  at  least  one  of  s  ( Jl  2 )  [sOl^)]  and  m(22) 

[m(2^-)]  is  equal  to  +«>  [resp.  -» ]  ,  ii)  is  not  a  common  jump  point  of 

s  and  m.  In  the  above  and  the  following  we  denote  s(a+)=lim  s(x)  and 

X4-  a. 

s  (a- )  =  1  imxT&s  ( x)  .  We  then  set  22  =  s(fi2-)  if  m(22)  =+°°,  22  =  +a>  if 
m(22)<+°°,  and  similarly  for  2^.  We  also  note  that  the  image  measure  dm 
is  induced  by  the  nondecreasing  function  m  given  by 
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SU) 


if  {  }  *  0, 
if  {  }  =  0. 


,  sup{m(x) :s(x)i$}, 

l  m(-«) , 

We  now  have  the  Green  function  H(a,£,r))>  a>0,  and  the 

transition  density  q(t  ,%,n),  t>0,  associated  to  dm  (and  2})  by 

the  standard  method,  which  we  give  in  the  following  for  completeness.  Fi 
a  continuity  point  5q£§  °f  ®»  anc*  <p2(£,cx)  be  the 

solutions  to  the  following  integral  equations 

V1(€.a)  =  1  +  al  U  -  n )v1(n ,a)dm(n ) ,  %  e  Q, 

*0  + 

U  -  n)vo(n, a)dm(n) ,  ?  e  9, 

+  * 

where  /|^+f  (n  )dm(rj  )=/^^  ^  j  f  (n  )dm(n  )  if  ?e[?0,J2],  =-/^  ^  j  f  (n  )dm(rj ) 
if  5s[2^,5q).  Then,  for  each  a>0,  there  exist  the  limits 

h,  (a)  =  -lira  v,  U  ,a  )/<s  ,  U  .  a )  , 

1  2  1 

h9 (a }  =  lim  <p9U  ,a)/v1  U  ,o) , 

^  ?t22  z  1 

where  we  use  the  usual  convention  l/»=0,  (±a)/0=±<»  for  positive  a. 
Define  the  function  h(a),'  a>0  and  UjU.o),  i  =  l,2,  ot>0  by 

l/h(«)  =  1/h^a)  +  l/h2(a), 

u^S.a)  =  <pjU»oO  +  (-l)i  +  1v2(^,a)/hi(o()  . 

It  is  then  known  that  (g  ,a)  [u2  (€  >cc)  ]  is  positive  and  nondecreasing 
[resp.  nonincreasing]  in  £  and  u^ (?q ,a ) =u2 (|q ,a ) = 1 .  Further,  their 
Wronskian  is  equal  to  l/h(a).  We  thus  obtain  the  associated  Green 
function  H(a,t‘,n)  and  the  transition  density  by 

H(o,5»h)  =  H(a,n»?)  =  h(a)Uj(^ ,a)u2(n ,a) ,  ^  ^  n  <  22, 


V2(g,a)  =  $  -  $0  +  «T 
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f  e-atq(t,$,n)dt  +  <J>U. n)  =  H(a,Z,n),  a  >  0,  5.n  e 
J0 

for  some  function  <J> ( 4 , n)  independent  of  a.  The  precise  formula  of 
<J>(I, n)  is  given  in  Minami  et  al .  (  1985),  but  we  do  not  use  it  here. 

Now  the  semigroup  for  the  pair  (s,m)  is  defined  by  the  formula 

(2.6)  f  e“atT+ f  (x)dt  =  f  H(a  ,  s  (x)  ,  s  (y) )  f  (y)dra(y)  ,  a>0,  xeQ,  fGB(Q)  , 

J  0  L  J  Q 

and  the  semigroup  for  the  stopped  motion  by 

(2.7)  r+”e_0(tT;f(x)dt  =  P  H(a,s(x),s(y))f(y)dm(y) 

J  0  Z  J  Q 


u,(s(x),a)  u. (s(x) ,a) 

+  f(ft2)au1T42-,«T  +  fUl)au1H1-,a)  -  a  >  0,  x  e  U,  f  6  B(Q)  , 

where  u(2^+  ,a)  =  lim^ij  u(£,a)  and  u(J2“  .a)  =  lim^tj  u(£,a)  and  B(S)  is 

1  2 

the  space  of  all  bounded  Borel  measurable  functions  on  S.  Notice  that 

T.f(x)  =  f  q(t,s(x) , s(y) )f (y)dm(y) , 

Z  “  Q 

Tif(x)  =  f  q(t,s(x) ,s(y))f(y) dm ( y ) 
z  JQ 

+  f  (fi2)J‘^qj^(x,s(x)  )dr  +  f  )  J^qj  ^  (x  ,  s  (x)  )  dx  , 

where  qj  (x,s(x))[qj  (r,s(x))]  is  the  inverse  Laplace  transform  of  the 

function  Uj ( s (x ) ,a )/u1 (S2~ , a )  [resp.  u2 ( s (x ) ,a )/u2 (Jj+ ,a ) ] .  Although  the 
semigroups  and  are  not  strongly  continuous  in  general,  there 

exist  Markov  processes  X= ( x ( t ) , e^ , Px)  on  Q  and  X' = (x ' ( t ) , P^)  on  Q 
such  that 


Ex[f(x(t)) ;eA>t]  =  Ttf(x) 


t>0,  xeQ,  feB(Q), 


E'(f(x*(t))]  =  T’f(x), 


t>0,  xeQ,  feB(Q), 


where  Ex  [ ]  stands  for  the  expectation  with  respect  to  Px[resp.  P^] 
(strictly  speaking  Q  is  joined  with  2^  or  22  whenever  they  are 
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reflecting  point).  We  call  the  process  X* = (x‘ ( t ) , P^)  a  stopped  BGDP  or 
briefly  a  SBGDP. 

2.3.2.  Limit  theorems  for  a  sequence  of  bi-general ized  diffusion 
processes 

The  question  whether  the  convergence  of  the  sequence  of  sfi  and 
implies  the  convergence  of  that  of  the  associated  processes  is  now  on 
stage.  Throughout  this  subsection,  we  adopt  the  convention  in  the 
previous  subsection  and  assume  that 

lim  sn(x)  =  s(x),  x  e  R, 

(2.8)  n-*°° 

lim  m  (x)  =  m(x),  x  e  Rsj(m), 

n-><» 

where  J(m)  is  the  set  of  jump  points  of  m.  As  is  shown  in  Ogura 
(1989),  if  J ( s )nJ (m)nQ=0,  then  the  induced  measures  dmn  converge  to 
din,  so  that  the  finite  dimensional  distributions  of  the  associated 
processes.  However,  this  is  false  in  general  if  the  condition 
J ( s )nJ (m)nQ-0  fails.  Furthermore,  in  the  following  applications,  we 
actually  encounter  with  the  case  of  J ( s )nJ (m)nQ^0,  to  which  we  now 
advance . 

Let  X(n)=(x(t) ,eA,p£n) )  on  Q(n)  and  X’ ( n) = (x * ( t ) , P^ ( n ) )  on 
the  BGDPs  and  SBGDPs  associated  to  sn  and  mn .  The  corresponding 
semigroups  are  denoted  by  T^n^  and  T^n^  respectively.  Also  the  space 
of  continuous  functions  on  Q  with  compact  support  is  denoted  by  Cq(Q). 
Theorem  2.1.  In  addition  to  the  condition  (2.8),  assume  that 

(2.9)  lim  in  ( O  =  mU),  ?  e  R\J(m), 
n-»<» 

(2.10)  lim  q°s(x)dm(x)=  q°s(x)dm(x), 

n-oo  J  [a,b]  n  n  J  [  a ,  b] 

a , b  e  ( J(s)uJ(m) )cnQ,  q  e  C(§). 

Then  for  every  x ^ »  •  ■  • » x^>0 ,  f ^ , f g  ,  .  . • , f^eCp (Q) ,  NeN ,  we  have 
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(2.11) 


lim  T^n)(f  T^n)(.  . .  (f  x(n) f  )...)) (x) 
n-»°o  T1  1  x2  N  1  XN  N 

=  TXi(fiTT2(  •  •  •  (fN-JLTTNfN)  '  '  ‘  )  >  (x)  *  x  e  Q. 

Remark  l)Since  the  conditions  (2.9)  and  (2.10)  are  automatically 
satisfied  in  the  case  of  J ( s )nJ (m)nQ=0,  we  use  Theorem  2.1  in  the  later 
without  mentioning  them  in  that  case. 

2)The  condition  (2.9)  does  not  seem  to  imply  (2.10)  in  general.  Indeed, 
in  view  of  the  relation 


I. 


s  *( (bj ,b2] ) 


•sn(x)dmn(x) 


*  I 


(bj  , b2  ] 


q(OdmnU) 


in  Ogura  (1989),  we  can  not  expect  much  information  on  (2.10)  if  a  or  b  is 

the  flat  point  of  s  or  s. 

n 

Proof  of  Theorem  2.1.  We  will  show  (2.11)  only  for  N=l.  The  outline  of 
the  proof  is  same  as  that  in  Ogura  (1989).  Indeed,  as  is  shown  there, 

(2.9)  implies  that  limn_tooqn(  t ,  ^  ,  r» )  =q(  t ,  ^  ,  n  )  uniformly  in  K  for 

every  compact  set  K  in  <Sj.  Hence  we  have 

lira  I JQqn(t,sn(x) ,sn(y) )f (y)dmn(y)  -  J^q( t , sn(x) , sn (y ) ) f ( y ) dmn ( y ) |  =  0. 

Further,  since  q(t,5,/j)  is  uniformly  continuous  in  5,rjeK,  we  have 

lim  |f  q(t,s  (x),s  (y))f(y)dm  (y)  -  f  q( t , s (x) , s  (y ) ) f ( y ) dm  (y ) |  =  0. 
n-*oo  JQnn  n  Jq  n  n 

Thus  the  assertion  follows  from  the  next  Lemma.  □ 

Lemma  2.1.  Under  the  assumption  of  Theorem  2.1,  it  holds  that 

(2.12)  lim  f  q°s  (x)f(x) dm  ( x ) =  f  q° s (x) f (x)dm(x) ,  qeC(§),  feCn(Q). 

n-*»  JQ  n  n  Jq  u 

Proof .  Denote  the  support  of  f  by  sppt(f)  and  fix  an  e>0.  Then,  due 
to  the  uniform  continuity  of  f,  we  can  find  a  0<5ie  such  that 
I f (x) -f (y) I <s  for  all  x.yesppt(f)  with  |x-yl<25.  Also  we  may  assume 
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that  lq(5)-q(n) l<E  for  all  x.yesppt(f)  with  !x-yl<2a.  Let  J5(s)={xeQ: 
s(x+)-s(x-)i5) •  It  then  follows  that 

lim  sup{ I q® s  (x) -q° s ( x) I : xeK\Js ( s ) }  £  e  for  compact  KcQ. 
n-*»  n  0 

Vr 

Denote  (s)nsppt(f )  =  {x1<X2<*  *  *  <xk^  an(^  let  U=ui  =  i*i*  *i=(ai*ki)  where 
A  i<ai<x  i<t>i<  *  •<ai<xi<bi<*  *<xk<'3k<^2  ’  xi_ai<&/2,  b^-x^<6/2  and  all  a.^ 
and  tK  are  continuity  point  of  s  and  m.  We  then  divide  the  desired 
difference  as 


lim  1 1 1 1  I  a  28110^ 
n-*<»  n 


lim  mn(sppt(f ) ) . 
n-»°° 


Further,  it  is  easy  to  show  that 


lim  I IV  |  a  2ellfllan  lim  ron(sppt ( f ) ) , 

n-*oo  n-»oo 

since  the  integrand  is  continuous  modulus  ell  f  II  . 

00 

Further , 


and 
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In(  i )  £  I  q°sn(x)  (f  (x)~f  (xi)  )dmn(x)  | 

+  q° sn(x)dran(x)-  q°s(x)dm(x) ) | 


The  first  and  the  third  term  is  dominated  by  esup{ I q°  sn( x) I :  xel ^ }mn( I ^ ) 
and  esup{ | q® s (x) I : xel^ }m( I i )  respectively.  Further,  from  (2.10),  the 
second  term  goes  to  zero  as  n-*« ,  so  that 

lim  1 1  |  «£  e  sup{  I  q°  s  (x )  I  :  xesppt  ( f )  }m(U)  . 

n-*oo 

Since  e>0  is  arbitrary,  we  obtain  (2.12).  □ 

Remark  In  view  of  the  above  proof,  (2.10)  can  be  replaced  with  a  weaker 
condition:  for  each  xeJ ( s )nJ (m)nQ  and  S>0  there  exists  an  interval 
I=(a,b)cQ  such  that  x~5<a<x<b<x+5 ,  a,b  6  ( J (s)uJ (m) ) cnQ  and  (2.10)  holds. 

As  is  noted  in  Ogura  (1989),  Theorem  2.1  does  not  assure  the 
convergence  of  the  finite  dimensional  distributions,  since  fjceCQ(Q)  is 
assumed.  However,  it  is  not  difficult  to  trace  the  proof  there  to  obtain 
the  next  theorem. 

Theorem  2.2.  Under  the  assumption  in  Theorem  2.1,  it  holds  that 
(2.13)  lim  P^n^(x(t1)  >  a1,...,x(tN)  >  aN) 

n-*<» 

=  Px(x(t1)  >  a1,...,x(tN)  >  aN), 
for  every  NeN ,  0  <t jCtgC- • - < t N ,  , a2 , • • • , a^eQNJ (m)  and  xeQ. 

3.  Population  genetics  models 

Let  xe ( t )  be  a  ODDP  on  [0,1]  with  the  infinitesimal  generator 
(3.1)  Le  =  ia£(x)-^  +  be<x>a!- 
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where  a_(x)>0  (0<x<l),  a_ (0)=a_ ( 1 )=0  and  b  (0)i0,  b_(l)iO.  The 

&  w  O  O  O 

concrete  functional  form  of  a  (x)  and  b  (x)  will  be  specified  below  in 

&  s 

(3.10).  The  ODDP  is  uniquely  determined  by  a  (x) ,  b  (x)  and  the 

E  & 

boundary  conditions  if  they  are  necessary.  If  the  boundary  0  or  1  is 
regular,  then  we  assume  the  reflection  boundary  condition  as  is  usually 
done  in  population  genetics  theory.  For  a,be[0,l],  we  define 


(  I?(a,b)  =  |  e 


pb  -B„<x) 


(3.2) 


e 


dx 


e  l>b  9  ( X ) 

l  I*(a’b)  =  Ja  a^TxT®  d* 

where  BE(x)=2j{be(x)/ag(x) }dx,  and 
r  i?(0,x)/i®(0,i) 


(3.3)  s6(x)  = 


(0  £  X  £  1) 

(otherwise)  , 


(3.4) 


mE(x) 


'  0 

l®(0,x)l!(0,l) 
.  I®(0,l)lf(0,l) 


(x  £  0) 

(0  <  x  i  1) 
(x  >  1). 


The  se(x)  and  me(x)  are  the  scale  function  and  the  speed  measure  of 
the  ODDP  xe(t),  respectively.  The  ODDP  xe(t)  is  uniquely  determined 
by  the  pair  (se (x) ,ms (x) ) . 

We  introduce  a  time  scaled  process 


(3.5)  Xg(t)  =  xeUet) 
for  xe(t),  where 

(3.6)  x&  =  I?(0,l)lJ(0,l). 

The  X  (t)  is  a  ODDP  with  the  scale  function  s  (x),  the  speed  measure 

S  E 

m  (x)  and  the  infinitesimal  generator  L 
E  & 
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(3.7)  s£(x)  =  se(x) 

'  0  (x  *  0) 

(3.8)  ®e(x)  =  me(x)/A.e  =  ■  if (0,x)/I®(0, 1)  (0  £  x  a  1) 

.1  (x  *  1) 

(3.9)  Le  =  xeLe. 

In  the  following,  we  consider  the  functional  form  of 
f  »e(x)  =  x(l  -  x)  +  caex2(l  -  x)2 

(3.10) 

bg (x)  =  ctgX ( 1  -  x){cfi(l  -  2x)  +  (1  -  c)g(x)}  +  0g ( 1  -  rx), 

where  B.ril,  c=0  or  c=l  and  8UPoixil 1  1  <+t>0 •  Let  G(x)  be 

G(x)=/g(x)dx  with  G(x)i0  (Osxil).  This  functional  form  of  (3.10)  covers 
many  models  in  population  genetics  as  we  see  in  the  following.  The 
population  genetical  meaning  of  the  ODDP  xe(t)  is  the  gene  frequency  of 
the  allele  Aj  in  the  population  at  time  t.  The  l-xe(t)  is  the  gene 
frequency  of  the  allele  Ag  in  the  population  at  time  t.  Here,  we  are 

considering  a  diploid  model  with  the  two  alleles,  Aj  and  .  The  term 

x(l-x)  in  a  (x)  denotes  the  effect  of  random  sampling  drift  (random 
effect  due  to  the  finiteness  of  population  size).  The  term  0  (1-rx) 

w 

in  b  (x)  denotes  the  effect  of  mutatiom.  Let  u  (resp.  v)  be  the 
& 

mutation  rate  from  Ag  (resp.  A^ )  to  Aj  (resp.  Ag).  The  parameter  ril 

is  r=l+v/u.  In  the  case  of  symmetric  mutation  (u=v),  r=2.  In  the  case 

of  one  way  mutation  from  Ag  to  A^ ,  r=l.  The  parameter  0£  corresponds 

to  2Nu,  where  N  is  the  (effective)  size  of  population.  The  remaining 

terms  in  a  (x)  and  b  (x)  denote  the  effect  of  selection.  Let  S  be  the 

6  & 

order  of  selection.  The  parameter  ag  corresponds  to  2NS.  The  case  of 
c=l  is  stochastic  selection.  In  the  stochastic  selection  model,  the  case 
of  6>1  is  the  SAS-CFF  model  proposed  by  Gillespie  (1978).  The  case  of 
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8  =  1  in  the  stochastic  selection  model  is  the  TIM  model  proposed  by 
Takahata  et  al .  (1975).  The  case  of  c=0  is  deterministic  selection. 

Two  typical  examples  of  g(x)  are  g(x)=l  (genic  selection)  and  g(x)=l-2x 
(overdominance) . 

The  conjecture  of  Gillespie  in  §1  in  the  case  of  K=2  is  to  consider 
the  limit  of  the  sequence  of  ODDPs  { xe ( t ) )  (after  an  appropriate  time 
scaling)  with  a_-*+«  and  8.  0.  In  this  paper,  we  consider  three  cases 
including  the  above.  (i)  a  -*+»  and  0_-*O.  (ii)  ce_-*a(0<ot<+<»)  and  0_-*O. 

o  D  c  E 

(iii)  a£-*0  and  0g-»O.  These  three  limits  are  referred  to  as  the  strong 
selection  -  weak  mutation  limit  (SSWM  limit),  the  moderate  selection  - 
weak  mutation  limit  and  the  weak  selection  -  weak  mutation  limit, 
respectively.  In  the  next  section,  we  condider  the  following  seven  cases 
with  the  time  scaling  x_  introduced  above.  (1)  ot_-*a  (Oia<+“) ,  0_-»O, 

b  £  D 

c=0 .  (2)  ae-*+oo,  0g-*O,  c=0 ,  r=2 ,  g(x)  =  l.  (3)  ae-»+°°,  0g-*O,  c=0,  r=2, 

g(x)  =  l-2x.  (4)  a_-*0,  0_-*O ,  c=  1 ,  r=2 ,  8il.  (5)  a_-»a  (0<a<+»),  0_-*O,  c=  1 , 

EE  EE 

r=2,  8il.  (6)  a  _-•+«> ,  0_-»O ,  c=l ,  r=2,  3>1.  (7)  a  _-*+<» ,  0  -*0,  c=  1 ,  r=2 ,  8  =  1. 

EE  EE 

These  seven  cases  are  important  examlpes  in  population  genetics. 

4.  Results  on  population  genetics  models 


We  have  the  following  results  using  the  limit  theorems  in  §2.  Let 
X  (t,xn)  be  X  (t)  defined  above  with  the  initial  condition  X  (0)=xn 
where  xQ  is  a  constant  (0£Xq£1). 


(1)  Deterministic  selection,  weak  or  moderate  selection  -  weak 
mutation  limit  (a _-*a;  0ia<+®,  0_-*O,  c=0) 

E  E 

For  simplicity,  we  consider  the  case  of  r*l.  Define 
(s (x) ,m(x) )  by 


(4.1)  s(x) 


r  jV2“G(y  W|V2“0(y)<iy 


(0  i  X  £  1 ) 
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I 


X 


(otherwise) 


(4.2)  m(x) 


r  0 

{1  +  1  i  e2a [G( 1 ) — G ( 0 ) ] } 

.  1 


(x  <  0) 

(0  *  x  <  1) 
(x  *  1) , 


where  G(x)  is  defined  in  the  previous  section.  Then,  we  have  s  (x) 

£ 

-♦s(x)  for  every  x  and  me(x)-*m  (x)  for  every  x  with  x*0.  Further, 
for  r*l,  we  have 

(4.3)  a6  -  i-|e2“°«»  *  — i-r  e2“0<l))J>1e'2“G<y)dy, 

£  0 

where  a~b_  means  a  /b  -*1  as  e-*0.  Note  that  Q=(-a>,+«),  J(s)=0, 

c  o  C  £ 

J (m) ={ 0 , 1 }  and  J(s)nJ(m)nQ=0.  By  Theorem  2.2,  (Xe(t,x0)}  converges  to 
a  Markov  jump  process  X(t,Xg)  for  t>0  in  the  sense  of  the  finite 
dimensional  distributions,  where 


(4.4) 


(  P(X(t,x0)  =  0)  =  p  +  (1  -  p  -  xQ)  e"t/p(1~p) 


l.  P(X(t,xQ)  =  1)  »  1  -  P(X(t,xQ)  =  0) 

and  p={l+  1  ^  G ( 0 )  ]  >  1.  The  (4  4J  obtained  by  the  method  in 

§2.3.1.  Note  that  A  -*+»  by  (4.3)  and  hence  L  =a_Le  does  not  have  a 

w  £  £ 

finite  limit.  In  this  case,  it  is  not  necessary  to  put  some  specific 

relation  between  a„  and  0„. 

8  8 


For  simplicity,  only  the  case  of  symmetric  mutation  (r=2)  will  be 
considered  in  the  following. 


(2)  Deterministic  genic  selection,  strong  selection  -  weak  mutation 
limit  (aB-*+»,  0g-*O,  c=0,  g(x)  =  l) 

In  this  case,  we  assume  that 

(4.5)  (log  0g  )/«e  -*  0  as  8  *♦  0 . 
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Note  that  (4.5)  holds  If  a„e  =C,  0<C<+®. 

o  8 

f1 

(4.6)  s(x)  =  j 


x 


Let  (s(x),m(x))  be 
(0  <  x  *  1) 

(otherwise) 


(4.7)  m(x) 


r  0 

k  1 


(x  <  1) 

(x  i  1)  . 


Under  the  condition  of  (4.5),  se(x)-*s(x)  for  every  x,  me(x)-*m(x)  for 
every  x  and 

2a 

(4.8)  Ae  e  s/2ae0e, 

as  e-*0.  Note  that  Q=(-®,+“),  J(s)  =  {0},  J(m)  =  {l)  and  J  ( s  )nJ  (m)r>Q=0. 
By  Theorem  2.2,  {Xe(t,xQ)}  converges  to  X(t,xQ)  in  the  sense  of  the 
finite  dimensional  distributions,  where 


(4.9)  P(X(t,x0)  =  1)  =  1 

and  t>0 . 


(3)  Deterministic  overdominance,  strong  selection  -  weak  mutation 
limit  (ae-*+®,  0e-*O,  c=0,  g(x)  =  l-2x) 

In  this  case,  we  assume  (4.5).  Let  (s(x),m(x))  be 


(4.10)  s(x) 


1/2 


x 


(0  <  x  <  1) 

(otherwise) 


(4.11)  m(x) 


r  0 

.  1 


(x  <  1/2) 

(x  ^  1/2 )  . 
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Under  the  condition  of  (4.5),  s£(x)-*s(x)  for  every  x,  me(x)-*m(x)  for 
every  x*l/2  and 

__  20-3/2  a_/2 

(4.12)  4/271  o_  e  e  e 

as  e-*0.  Note  that  Q=(-00  .+00  ),  J(s)  =  {0,l),  J(m)  =  {l/2)  and  J ( s )nJ (m)nQ=0. 
By  Theorem  2.2,  (XB(t,x0)}  converges  to  X(t,xQ)  in  the  sense  of  the 
finite  dimensional  distributions,  where 

(4.13)  P(X(t,xQ)  =  1/2)  =  1 
and  t>0 . 


(4^  Stochastic  selection,  weak  selection  -  weak  mutation  limit 
(ae-*0,  0E-*O,  c=l ,  0il) 

Let  (s(x) ,ra(x) )  be 

(4.14)  s(x)  =  x 


(4.15)  m(x) 


'  0 
1/2 
.  1 


(X  <  0) 

(0  £  X  <  1) 
(X  i  1) 


Then,  s£(x)-*s(x)  for  every  x,  ra£  (x)-»ra(x)  for  every  x*  0  and 
(4.16)  a£  -  2/ee2 

as  e-»0.  Note  that  Q=(-»,+w),  J(s)=0,  J(m)  =  {0,l}  and  J  ( s)nJ  (m)r»Q=0. 
By  Theorem  2.2,  {Xg(t,x0)}  converges  to  a  Markov  jump  process  X(t,xQ) 
in  the  sense  of  the  finite  dimensional  distributions,  where 


(4.17) 


'  P(X(t ,xQ)  =  0)  =  {1  +  (1  -  2x0)e  4t }/2 
.  P(X(t,xQ)  =  1)  =  1  -  P(X(t,xQ)  =  0) 


and  t>0 . 
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(5)  Stochastic  selection,  moderate  selection  -  weak  mutation  limit 


(ag-*a;  0<a<+»,  0g-*O,  c=l,  0il) 
Let  (s(x) ,m(x) )  be 


(4.18)  s(x) 


f  (ay(l  -  y)  +  1}  edy/f  (oy(l  -  y) 
d  0  J  0 


+  l}'edy 


(Oix^l ) 


x 


(otherwise) 


(4.19)  m(x) 


r  0 
1/2 
,  1 


(x  <  0) 

(0  £  X  <  1) 
(x  i  1  )  . 


Then,  s  (x)-»s(x)  for  every  x,  m  (x)-*ra(x)  for  every  x*0  and 

B  B 

(4.20)  Ae  2j^{ay(l  -  y)  +  l}~edy/0e 

as  e-*0.  Note  that  Q=(-“,+“),  J(s)=0,  J(m)  =  {0,l}  and  J ( s )nJ (ra)nQ=0. 
By  Theorem  2.2,  (Xe(t,x0)}  converges  to  a  Markov  jump  process  X(t,xQ) 
In  the  sense  of  the  finite  dimensional  distributions,  where 


(4.21) 


r  P(X(t,xQ)  =  0)  =  {1  +  (1  -  2xq) e~4t }/2 
.  P(X(t,xQ)  =  1)  =  1  -  P(X(t,xQ)  =  0) 


and  t>0 . 


(6)  Stochastic  selection.  SAS-CFF  model,  strong  selection  -  weak 
mutation  limit  (a_-*+°°,  0 _-*0 ,  c=l,  0>1) 

B  B 

In  this  case,  we  assume  that 
(4.22)  (log  0g ) / ( log  o(g )  -»  0  as  e  -*  0. 

Note  that  (4.22)  is  equivalent  to 
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(4.23)  a  -»  +oo  as  e  -*  0,  for  every  b  >  0. 

£  8 

The  condition  (4.23),  hence  (4.22)  holds  if  0  log  a  =C,  0<C<+®.  Let 

8  G 

(s(x) ,m(x) )  be 

r  1/2  (0  <  x  <  1) 

(4.24)  s (x)  = 

.  x  (otherwise) 


(4.25)  m(x) 


(x  s  0) 

y)}e~2dy  /  f  {y ( 1  -  y)}e_2dy  (O^xsil) 

J  0 

(x  *  1). 


Under  the  condition  of  (4.22),  s  (x)-*s(x)  for  every  x,  ra  (x)-*m(x)  for 

G  8 

every  x  and 

(4.26)  a6  -  j-4-t  «ee_2To{y( 1  _  y)>e_2dy 

as  e-*0.  Note  that  Q=(-oo,+oo),  J(s)  =  {0,l},  J(m)=0  and  J  (s)nJ  (m)nQ=0. 
By  Theorem  2.2,  {XE(t,xQ)}  converges  to  X(t,xQ)  in  the  sense  of  the 
finite  dimensional  distributions,  where 


(4.27)  P(X(t,xQ)  e  A)  =  J  dm(y) 

A 


Ac [0,1]  and  t>0 . 


(7)  Stochastic  selection.  TIM  model,  strong  selection  -  weak  mutation 
limit  (a_-*+oo,  0  -*0,  c=l,  0  =  1) 

G  G 

In  this  case,  we  assume  that 

(4.28)  0_  log  a.  -»  0  as  8  -*  0 . 

8  8 

The  condition  (4.28)  holds  if  a_0_  =  C,  0<C<+«>.  Let  (s(x),m(x))  be 

8  8 
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(4.29) 


(4.30) 


1/2 

(0  <  x  <  1) 

X 

(otherwise) 

0 

(x  <  0) 

1/2 

(0  £  X  <  1) 

1 

(x  *  1)  . 

Under  the  condition  of  (4.28),  s  (x)-*s(x)  for  every  x,  m  (x)-»m(x)  for 

w  C 

every  xs*0 ,  3L(x)-»m(x)  for  xeRsJ(m),  (2.10)  holds  and 
& 

4  log  a 

(4-31) 

as  e-*0.  Note  that  Q=(-oo,+oo),  J(s)=J(m)  =  {0, 1 }  and  J ( s )nJ (m)nQ= { 0 , 1 } . 
By  Theorem  2.2,  (X^t.Xg)}  converges  to  a  Markov  jump  process  X(t,xQ) 
in  the  sense  of  the  finite  dimensional  distributions,  where 


(4.32) 


'  P((X(t,x0)  =  0)  =  (1  +  (1  -  2x0)e"4t}/2 

.  P(X(t,xQ)  =  1)  =  1  -  P(X(t,xQ)  =  0) 


and  t>0. 


5.  Other  time  scalings  for  population  genetics  models 

In  the  case  of  strong  selection  -  weak  mutation  limit  for  the 
deterninistic  genic  or  overdominance  selection  (the  case  (2)  and  (3)  in 
the  previous  section),  the  limit  process  X(t)  is  trivial  since  this 
process  is  traped  at  some  point  instanteniously.  The  conjecture  of 
Gillespie,  however,  imlpies  that  the  limit  process  stays  at  boundary 
points  with  finite  holding  time  then  jumps  to  some  trap  point.  In  this 
section,  we  will  obtain  the  limit  processes  with  finite  holding  time  at 
the  boundary.  The  reason  why  the  limit  prcesses  in  the  previous  section 
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move  to  the  trap  point  instantinuously  is  that  the  time  scaling 
a8-i!(0.1)I®(0.1)  tends  to  infinity  too  fast  as  e-*0.  In  this  section, 
we  will  introduce  a  time  scaling  t  that  tends  to  infinity  but  slower 

w 

than  Ag. 

Let  xe(t)  be  the  ODDP  in  the  previous  section.  We  define  a  new 

time  scaling  r  =r_(5)  by 
6  & 

(5.1)  TS  =  Te(s)  = 

for  0<5£l.  The  time  scaled  process  for  xe(t)  by  t  is 

w 

(5.2)  Yg(t)  =  xe(xet). 


The  infinitesimal  generator,  the  scale  function  and  the  speed  measure  are 


denoted  by 
(5.3)  Le 


L  ,  s  (x)  and  m  (x)  again.  Then 

&  O  w 


=  TeL6  ‘  Te{2ae(x)d^2  +  be(x)dx} 


'  I®(0,x)/I?(0,l)  (0  £  x  £  1) 

(5.4)  s£(x)  =  ■ 

„  x  (otherwise) 


'  0  (x  a:  0) 

(5.5)  mg(x)  =  I®  (0 , x)/I® (0,5)  (0  £  x  £  1) 

.  I®(0,1)/I® (0,5)  (x  s  1) 

We  have  the  following  results.  We  consider  the  case  of  symmetric 
mutation  (r=2)  for  simplicity. 


(8)  Deterministic  genic  selection,  strong  selection  -  weak  mutation 
limit  (a_-*+co ,  8_-*0 ,  c=0,  g(x)  =  l) 

&  C 

In  this  case,  we  fix  a  5  (0<5<1)  and  assume  that 

log  0 

(5.6)  lira  — - — —  £  -2. 

£10  “e 
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2a 

The  condition  (5.6)  holds  if  0  e  e=C,  0<C<+».  Let  (s(x),m(x))  be 

w 


(5.7)  s ( x ) 


(  1 


^  x 


(0  <  x  i.  1 ) 


(otherwise) 


(5.8)  m(x) 


r  0 
1 

t  +00 


(X  <  0) 

(0  £  X  <  1) 
(X  i  1 )  . 


Under  the  condition  of  (5.6),  s  (x)-*s(x)  for  every  x,  m_(x)-*m(x)  for 

O  O 

every  x;*0,  m  (x)->  ro(x)  for  xeR\J(ro),  (2.10)  holds  and 

w 

20.-1 

(5.9)  x£  =  r£  (5 )  -x  a£  e  /20e 

as  e-*0.  Note  that  Q=(-“,l],  J(s)  =  {0},  J(m)  =  {0,l}  and  J ( s )nJ (ra)nQ={0 } . 
By  Theorem  2.2,  {Y£(t,Xg)}  converges  to  a  Markov  jump  process  Y(t,xQ) 
in  the  sense  of  the  finite  dimensional  distributions,  where  Ye(t,xQ) 
(resp.  Y(  t ,  xn ) )  is  the  process  Y  (t)  (resp.  Y(t))  with  Y.(0)=x.  (resp 

U  B  B  U 

Y(t)=xQ)  and 


(5.10) 


r  P ( Y( t , 0 )  =  0)  =  e  t 
.  P( Y( t , 0 )  =  1)  =  1  -  e_t 


(5.11)  P(Y(t,x0)  =  1)  =  1  (0  <  xQ  i  1) 

for  t>0. 


(9)  Deterministic  overdominance,  strong  selection  -  weak  mutation  limit 

(«e-*+®,  0g-*O ,  c=0 ,  g(x)  =  l-2x) 

In  this  case,  we  fix  a  6  (0<6<l/2)  and  assume  that 

log  0  a  /2  _ 

(5.12)  lim  — - — -  i  -1/8  and  0.  e  B  //cTT  -*  +»  as  e  -*  0 . 
eao  “e  e  s 
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Note  that  the  first  condition  in  (5.12)  is  equivalent  to 


(5.13)  0  e  e  -*  0  as  e  -*  0  for  every  b  <  1/8, 

5 

-a-/8 

and  (5.12)  holds  if  0£=Caee  ,  0<C<+°°.  Consider  Xq  with  0£Xq<1/2 

for  a  while.  Let  (s(x),m(x))  be 


(5.14)  s(x)  = 


1/2 


(0  <  x  <  1) 

(otherwise) 


(5.15)  m(x) 


0 

1 

+  00 


(x  <  0) 

(0  £  x  <  1/2) 
(x  *  1/2) 


Under  the  condition  of  (5.12),  s  (x)-*s(x)  for  every  x,  m  (x)-»ra(x)  for 

D  5 

every  x;*0 ,  rae  (x)-»m(x)  ,  (2.10)  holds  and 
20-1 

(5.16)  xE  -  ae  /fle 

as  e-*0.  Note  that  Q=(-«,l/2],  J(s)  =  {0,l},  J(m)  =  {0 , 1/2 }  and 
J(s)nJ(m)nQ={0} .  By  Theorem  2.2,  (Y^t.Xp)}  converges  to  a  Markov  jump 
process  Y(t,xQ)  in  the  sense  of  the  convergence  of  the  finite  dimensional 
distributions,  where 


(5.17) 


'  P  ( Y(  t ,  0 )  =  0)  =  e 
.  P ( Y( t , 0 )  €  [1/2,1])  =  1  -  e’2t 


(5.18)  P(Y(t,x0)  e  [1/2,1])  =1  (0  <  xQ  <  1/2) 

for  t>0.  Now,  we  consider  Xq  with  O^Xq^I .  By  the  symmetry 
at  x=l/2,  (Ye(t,x0)}  converges  to  Y(t,xQ)  in  the  sense  of  the 
finite  dimensional  distributions,  where 
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(5.19) 


(5.20) 


(5.21) 


'  r(Y(t,0)  =  0)  =  e 
.  P( Y( t , 0 )  =  1/2)  =  1  -  e‘2t 

P(Y(t , xQ)  =  1/2)  =1  (0  <  xQ  <  1) 

r  P ( Y( t , 1 )  =  1)  =  e~2t 
.  P( Y( t , 1 )  =  1/2)  =  1  -  e-2t 


for  t>0. 


6.  Discussion 

The  natural  candidate  for  the  time  scaling  is  that  of  (3.5)  with  which 

the  normalization  s  (0)  =  m  (0)  =  0  and  s  (1)  =  m  (1)  =  1  holds.  This 

£  £  £  £ 

time  scaling  is  used  in  §4.  First,  we  consider  the  weak  selection  -  weak 
mutation  limit  (see  the  case  (1)  and  the  case  (4)).  In  this  case,  random 
sampling  drift  due  to  the  finite  population  size  predominates  over  the 
effect  of  selection  and  mutation.  The  limits  are  Markov  chains  with  state 
space  {0,1},  that  is,  the  raonomorphic  states  due  to  random  sampling  drift. 
In  the  moderate  selection  -  weak  mutation  limit,  the  results  are  very 
similar  to  those  of  weak  selection  -  weak  mutation  limit  (see  the  case  (1) 
and  the  case  (5)).  The  reason  is  that  the  effect  of  mutation  is  very  weak 
in  these  cases  and  random  sampling  drift  predominates  over  selection  in 
the  neighborhood  of  0  and  1.  The  effect  of  moderate  selection  can  be  seen 
only  on  the  time  scaling  (4.3),  (4.20)  and  p  in  (4.4). 

Next,  we  consider  the  strong  selection  -  weak  mutatiom  limit  which 
corresponds  to  the  conjecture  of  Gillespie.  First,  we  consider  the 
stochastic  selection  models.  In  the  SAS-CFF  model  (see  the  case  (6)),  the 
limit  is  the  stationary  distribution  for  the  pure  stochastic  selection 
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model  without  random  sampling  drift  and  mutation.  This  result  is 
consistent  with  the  conjecture  of  Gillespie  (see  §1).  Since  there  is  a 
unique  stationary  distribution  in  this  case,  the  state  space  of  the  limit 
is  stationary  distribution  valued  but  with  no  jumps.  In  the  TIM  model 
(see  the  case  (7)),  the  limit  is  a  Markov  chain  with  the  state  space  {0,1}. 
This  result  is  consistent  with  the  following  fact.  In  the  pure  TIM  model 
without  random  sampling  drift  and  mutation,  there  is  no  non-trivial 
stationary  distributions  and  the  quasi  fixation  will  occur  as  t  -*  +».  The 
limit  process  jumps  between  two  trivial  stationary  points  0  and  1.  The 
results  on  deterministic  selection  models,  however,  are  not  consistent 
with  the  conjecture  of  Gillespie  (see  the  case  (2)  and  (3)).  In  the 
limit,  these  processes  go  to  some  trapped  point  instanteniously .  This 
trivial  results  are  due  to  the  incorrect  choise  of  the  time  scaling  in 
these  cases.  In  §5,  we  have  introduced  an  appropriate  time  scaling  for 
which  non-trivial  results  are  obtained  (see  the  case  (8)  and  (9)).  The 
results  for  the  case  (8)  and  (9)  are  very  consistent  with  the  conjecture. 
The  state  space  of  the  limit  is  stationary  points  for  the  ordinary 
differential  equation  that  describes  the  pure  deterministic  selection 
model  without  random  sampling  drift  and  mutation.  The  limit  process  goes 
to  the  unique  stable  stationary  point  for  the  ordinary  differential 
equation  as  t  ■*  +». 

As  we  have  seen  in  the  above,  we  have  proved  the  conjecture  of 
Gillespie.  The  convergence  is  that  of  the  sense  of  the  finite  dimensional 
distributions.  We  have  identifed  the  limit  processes,  the  appropriate 
time  scaling  and  the  relation  between  ex  and  0  .  We  have  applied  the 

w  w 

notion  of  bi-generalized  diffusion  processes  and  their  limit  theorems 
systematically.  Our  method  is  effective  not  only  for  the  present  problem 
but  also  for  many  models  that  are  described  by  diffusion  equation  (for 
examlpe,  a  quantitative  genetics  model  in  Newman  et  al .  (1985))  as  far  as 
the  model  is  one-dimensional.  For  the  multidimensional  cases,  our  method 
is  not  applicable  since  the  notion  of  bi-generalized  diffusion  processes 
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is  just  for  one-dimensional  stochastic  processes  and  the  proof  of  the 
conjecture  of  Gillespie  for  the  multidimensional  cases  is  an  open  problem. 
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